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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



April 10, 2008, 00:54:56 ; Search time 669 Seconds 
(without alignments) 

14410.097 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-529-592A-3 

893 

1 ccgcgggaggcgcgcggctg. 



. aaataaagatcctctgtaac 893 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 9073515 seqs, 5397694045 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database : N_Geneseq_2 0 0 711 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000 : * 

4: geneseqn2001a : * 

5: geneseqn2001b : * 

6: geneseqn2002a : * 

7: geneseqn2002b : * 

8: geneseqn2003a : * 
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9: geneseqn2003b : * 

10: geneseqn2003c : * 

11: geneseqn2003d: * 

12: geneseqn2004a: * 

13: geneseqn2004b: * 

14: geneseqn2004c : * 

15: geneseqn2004d: * 

16: geneseqn2005a : * 

17: geneseqn2005b: * 

18: geneseqn2005c : * 

19: geneseqn2006a : * 

20: geneseqn2006b: * 

21: geneseqn2006c : * 

22: geneseqn2007 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


893 


100 


0 


893 


12 


ADM96969 


Adm96 96 9 Human pan 




2 


893 


100 


0 


893 


22 


AES72473 


Aes72473 Human C19 




3 


827 


92 


6 


881 


12 


ADM96967 


Adm96 96 7 Human pan 




4 


827 


92 


6 


881 


22 


AES72471 


Aes72471 Human C19 


c 


5 


671 


75 


1 


963 


4 


AAI90742 


Aai90742 Human pol 


c 


6 


671 


75 


1 


963 


9 


ADE09694 


Ade0 96 9 4 Novel DNA 




7 


662.2 


74 


2 


761 


4 


AAK88446 


Aak88446 Human dig 


c 


8 


611 


68 


4 


614 


13 


ADR26 6 70 


Adr26670 Breast ca 




9 


549 


61 


5 


2798 


4 


AAK90422 


Aak90422 Human dig 




10 


547.4 


61 


3 


2804 


4 


AAK90423 


Aak90423 Human dig 




11 


367 


41 


1 


574 


6 


ABL65256 


Abl65256 Lung cane 




12 


183 


20 


5 


203 


7 


AFS82561 


Afs82561 Human tra 




13 


183 


20 


5 


1617 


9 


ADE07493 


Ade07493 Novel cod 




14 


124 


13 


9 


170 


12 


ACH91110 


Ach91110 Human gen 




15 


124 


13 


9 


550 


12 


ACH77410 


Ach77410 Human gen 




16 


105 . 4 


11 


8 


2721 


6 


ABS61407 


Abs61407 Prostate 




17 


92 . 8 


10 


4 


763 


6 


ABQ27175 


Abq27175 Oligonucl 


c 


18 


92 . 8 


10 


4 


763 


6 


ABQ27174 


Abq27174 Oligonucl 


c 


19 


69 . 4 


7 


8 


763 


6 


ABQ27177 


Abq27177 Oligonucl 




20 


69 . 4 


7 


8 


763 


6 


ABQ27176 


Abq27176 Oligonucl 




21 


52 


5 


8 


2000 


8 


ADA71938 


Ada71938 Rice gene 


c 


22 


52 


5 


8 


114955 


2 


AAX53491 


Aax53 491 Human ade 




23 


51.6 


5 


8 


114955 


2 


AAX53491 


Aax53 491 Human ade 




24 


48 . 8 


5 


5 


6276 


12 


ADP07263 


Adp0 72 63 Human COL 




25 


48 . 8 


5 


5 


6276 


13 


ADU06039 


Adu06039 Novel bro 
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26 


48 . 8 


5 . 


, 5 


6726 


13 


ADQ22548 


Adq2 25 4 8 Human sof 


c 


27 


48 


5 . 


. 4 


1635 


11 


ACN44187 


Acn4 418 7 Human mRN 


c 


28 


48 


5 . 


. 4 


22147 


11 


ACN44186 


Acn44186 Human gen 


c 


29 


48 


5 . 


. 4 


55211 


13 


ADQ97266_3 


Continuation (4 of 


c 


30 


47 . 8 


5 . 


. 4 


547 


14 


AGG40577 


Agg40577 Human sin 


c 


31 


47 . 8 


5 . 


. 4 


547 


14 


AGG40578 


Agg40578 Human sin 




32 


47 . 6 


5 . 


.3 


645 


14 


AFR13959 


Afrl3959 Recombina 




33 


47 . 4 


5 , 


.3 


615 


14 


AFR14548 


Afrl4548 Recombina 




34 


47 . 4 


5 . 


.3 


9767 


4 


AAL26727 


Aal26727 Human bre 




35 


47 . 4 


5 . 


,3 


9767 


4 


AAL26476 


Aal26476 Human bre 




36 


47 . 4 


5 , 


,3 


9767 


4 


AAL26682 


Aal26682 Human bre 




37 


47 . 4 


5 . 


.3 


9767 


4 


AAL26568 


Aal26568 Human bre 




38 


47 . 4 


5 , 


.3 


9767 


4 


AAL26438 


Aal26438 Human bre 




39 


47 . 4 


5 , 


,3 


9963 


10 


ACN88549 


Acn88549 Breast ca 




40 


47 . 2 


5 . 


.3 


768 


14 


AFR16600 


Afrl6600 Recombina 


c 


41 


47 


5 , 


,3 


2181 


6 


ABN95812 


Abn95812 Gene #231 


c 


42 


46 . 8 


5 . 


. 2 


564 


12 


ADJ43755 


Adj43755 Plant cDN 


c 


43 


46 . 8 


5 . 


. 2 


564 


12 


ADJ43765 


Adj43765 Plant cDN 


c 


44 


46 . 8 


5 , 


, 2 


56 7 


12 


ADJ43710 


Adj43710 Plant cDN 


c 


45 


46.8 


5. 


.2 


567 


12 


ADJ39391 


Adj39391 Plant cDN 



ALIGNMENTS 



RESULT 1 
ADM96969 

ID ADM96969 standard; cDNA; 893 BP. 

XX 

AC ADM96 96 9; 
XX 

DT 01-JUL-2004 (first entry) 
XX 

DE Human pancreatic cancer upregulated gene C1958V2. 
XX 

KW ds; gene; cytostatic; gene therapy; pancreatic cancer; diagnosis; 

KW anti-tumor immunity. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 197. .259 

FT /*tag= a 

FT /product= "C1958V2 protein" 

XX 

PN WO2004031411-A2 . 
XX 

PD 15-APR-2004. 

XX 
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PF 12-SEP-2003; 2003WO-JP011713 . 
XX 

PR 30-SEP-2002; 2002US-0414872P . 

PR 28-FEB-2003; 2003US-0450889P . 
XX 

PA (0NC0-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakamura Y, Katagiri T; 
XX 

DR WPI; 2004-330204/30. 

DR P-PSDB; ADM96970. 

XX 

PT New C1958V1 or C1958V2 polypeptides, useful in useful in diagnosing and 

PT treating pancreatic cancer and in inducing anti tumor immunity. 

XX 

PS Claim 2; SEQ ID NO 3; 71pp; English. 

XX 

CC The invention relates to the isolation of novel genes upregulated in 

CC pancreatic cancer designated C1958V1 and C1958V2, their encoded 

CC polypeptides (I), a seguence in which one or more amino acids are 

CC substituted, deleted, inserted, and/or added and that has a biological 

CC activity eguivalent to the C1958V1 or C1958V2 proteins; or a sequence 

CC encoded by a polynucleotide that hybridizes under stringent conditions to 

CC the C1958V1 or C1958V2 polynucleotides. The polypeptides and 

CC polynucleotides, compounds and compositions are useful in diagnosing and 

CC treating pancreatic cancer and in inducing anti tumor immunity. This 

CC sequence represents the C1958V2 cDNA sequence. 

XX 

SQ Sequence 893 BP; 175 A; 287 C; 274 G; 157 T; 0 U; 0 Other; 

Query Match 100.0%; Score 893; DB 12; Length 893; 

Best Local Similarity 100.0%; Pred. No. 2.7e-205; 

Matches 8 93; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CCGCGGGAGGCGCGCGGCTGCCCGAGCGCCGGCCGGGCCATGACCCCCGCTGCTCTGTCT 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 CCGCGGGAGGCGCGCGGCTGCCCGAGCGCCGGCCGGGCCATGACCCCCGCTGCTCTGTCT 6 0 

Qy 61 TGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGACCGCCGCCGCCACCACCACCAGCGCCC 12 0 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGACCGCCGCCGCCACCACCACCAGCGCCC 12 0 

Qy 121 GGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGCAGTGAGCCCACCAAGAAGGAAGCGGC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGCAGTGAGCCCACCAAGAAGGAAGCGGC 180 

Qy 181 CTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAAGCAGCAGCAGCAGCCAC 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 181 CTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAAGCAGCAGCAGCAGCCAC 2 40 

Qy 241 GACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCC 300 

I I I I I I I I I I I I I I I I I I I I I I II I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCC 300 

Qy 301 CCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAG 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAG 36 0 

Qy 361 ACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGT 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGT 42 0 

Qy 421 TTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCT 480 

Qy 481 GGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAG 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAG 5 40 

Qy 541 AGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 AGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGG 6 00 

Qy 601 CCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGA 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 CCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGA 66 0 

Qy 661 AAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAAC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 AAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAAC 720 

Qy 721 ACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGA 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGA 78 0 

Qy 781 ACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCC 840 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 ACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCC 840 

Qy 841 AAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 AAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 

RESULT 2 
AES72473 
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ID AES72473 standard; cDNA; 893 BP. 

XX 

AC AES72473; 
XX 

DT 03-MAY-2007 (first entry) 
XX 

DE Human C1958 splice variant 3, cDNA. 
XX 

KW Pancreatic ductal adenocarcinoma; cancer; cytostatic; tumor marker; 

KW protein therapy; screening; splice variant; ss; gene; C1958V3; apoptosis; 

KW gene therapy; pancreas tumor; lung tumor; renal tumor; testicle tumor. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 197. .259 

FT /*tag= a 

FT /product= "C1958V3" 

XX 

PN WO2007013358-A2 . 
XX 

PD 01-FEB-2007. 

XX 

PF 14-JUL-2006; 2006WO-JP314442 . 
XX 

PR 28-JUL-2005; 2005US-0703791P . 
XX 

PA (0NC0-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakamura Y, Katagiri T, Inaki K; 
XX 

DR WPI; 2007-283242/27. 

DR P-PSDB; AES72474. 

XX 

PT New VIVIT polypeptide useful for treating or preventing cancer, such as 

PT pancreatic cancer, lung cancer, kidney cancer and testicular tumor. 
XX 

PS Disclosure; SEQ ID NO 3; 78pp; English. 

XX 

CC The invention relates to a VIVIT polypeptide (AES72497) and at least a 

CC fragment of the human C1958 sequence appearing as AES72472, in which 

CC residues at positions 37-41 is replaced with AES72497, or an amino acid 

CC sequence of a polypeptide functionally equivalent to the polypeptide 

CC comprising the fragment sequence. Also included are an agent for 

CC treating/preventing cancer (comprising as an active ingredient the VIVIT 

CC polypeptide, or a polynucleotide encoding the polypeptide) , a 

CC pharmaceutical composition (comprising the VIVIT polypeptide, and a 

CC carrier), screening (Ml) for a compound useful in treating/preventing 
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CC cancers (involving (a) contacting a polypeptide comprising a PPP3CA- 

CC binding domain of a C1958 polypeptide with a polypeptide comprising a 

CC C1958-binding domain of a PPP3CA polypeptide in the presence of a test 

CC compound, (b) detecting binding between the polypeptides, and (c) 

CC selecting a test compound that inhibits binding between the 

CC polypeptides), a kit for screening for a compound for treating or 

CC preventing cancers, treating/preventing cancers in a subject (involving 

CC administering the compound selected above) , and inducing apoptosis in a 

CC cell (involving introducing a polypeptide having a dominant-negative 

CC effect against C1958 or a polynucleotide encoding the polypeptide into 

CC the cell, where the polypeptide comprises a fragment sequence having 

CC AES72497, or the mutated C1958 above) . The polypeptide is modified with a 

CC cell-membrane permeable substance, which has the general formula [R]-[D], 

CC where [R] represents the cell-membrane permeable substance, and [D] 

CC represents the amino acid sequence of a fragment sequence of AES72497, or 

CC the mutated C1958 peptide. The VIVIT polypeptide is useful for treating 

CC and/or preventing cancer, preferably pancreatic cancer (especially 

CC Pancreatic ductal adenocarcinoma) , lung cancer, kidney cancer and 

CC testicular tumor. (Ml) is useful for screening a compound for treating or 

CC preventing cancers. The present sequence encodes a splice variant of 

CC human C1958. 

XX 

SQ Sequence 893 BP; 175 A; 287 C; 274 G; 157 T; 0 U; 0 Other; 

Query Match 100.0%; Score 893; DB 22; Length 893; 

Best Local Similarity 100.0%; Pred. No. 2.7e-205; 

Matches 8 93; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CCGCGGGAGGCGCGCGGCTGCCCGAGCGCCGGCCGGGCCATGACCCCCGCTGCTCTGTCT 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 CCGCGGGAGGCGCGCGGCTGCCCGAGCGCCGGCCGGGCCATGACCCCCGCTGCTCTGTCT 6 0 

Qy 61 TGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGACCGCCGCCGCCACCACCACCAGCGCCC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGACCGCCGCCGCCACCACCACCAGCGCCC 12 0 

Qy 121 GGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGCAGTGAGCCCACCAAGAAGGAAGCGGC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGCAGTGAGCCCACCAAGAAGGAAGCGGC 180 

Qy 181 CTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAAGCAGCAGCAGCAGCCAC 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAAGCAGCAGCAGCAGCCAC 2 40 

Qy 241 GACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCC 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCC 300 

Qy 301 CCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAG 36 0 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 CCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAG 36 0 

Qy 361 ACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGT 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGT 42 0 

Qy 421 TTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCT 480 

Qy 481 GGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAG 5 40 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAG 5 40 

Qy 541 AGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGG 6 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 AGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGG 6 00 

Qy 601 CCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGA 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 CCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGA 66 0 

Qy 661 AAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAAC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 AAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAAC 72 0 

Qy 721 ACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGA 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGA 78 0 

Qy 781 ACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCC 840 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 ACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCC 840 

Qy 841 AAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 AAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 893 



RESULT 3 
ADM96 96 7 

ID ADM96967 standard; cDNA; 881 BP. 

XX 

AC ADM96 96 7; 
XX 

DT 01-JUL-2004 (first entry) 
XX 

DE Human pancreatic cancer upregulated gene C1958V1. 
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SCORE Search Results Details for Application U 1529592 ind Search Result 20080410_005435_us-10-529-592a- 3 .rng. 
XX 

KW ds; gene; cytostatic; gene therapy; pancreatic cancer; diagnosis; 

KW anti-tumor immunity. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 163. .393 

FT /*tag= a 

FT /product= "C1958V1 protein" 

XX 

PN WO2004031411-A2 . 
XX 

PD 15-APR-2004. 

XX 

PF 12-SEP-2003; 2003WO-JP011713 . 
XX 

PR 30-SEP-2002; 2002US-0414872P . 

PR 28-FEB-2003; 2003US-0450889P . 
XX 

PA (ONCO-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakaraura Y, Katagiri T; 
XX 

DR WPI; 2004-330204/30. 

DR P-PSDB; ADM96968. 
XX 

PT New C1958V1 or C1958V2 polypeptides, useful in useful in diagnosing and 

PT treating pancreatic cancer and in inducing anti tumor immunity. 

XX 

PS Claim 2; SEQ ID NO 1; 71pp; English. 

XX 

CC The invention relates to the isolation of novel genes upregulated in 

CC pancreatic cancer designated C1958V1 and C1958V2, their encoded 

CC polypeptides (I), a seguence in which one or more amino acids are 

CC substituted, deleted, inserted, and/or added and that has a biological 

CC activity eguivalent to the C1958V1 or C1958V2 proteins; or a seguence 

CC encoded by a polynucleotide that hybridizes under stringent conditions to 

CC the C1958V1 or C1958V2 polynucleotides. The polypeptides and 

CC polynucleotides, compounds and compositions are useful in diagnosing and 

CC treating pancreatic cancer and in inducing anti tumor immunity. This 

CC seguence represents the C1958V1 cDNA sequence. 

XX 

SQ Sequence 881 BP; 178 A; 276 C; 264 G; 163 T; 0 U; 0 Other; 

Query Match 92.6%; Score 827; DB 12; Length 881; 
Best Local Similarity 97.5%; Pred. No. 2.4e-189; 

Matches 859; Conservative 0; Mismatches 0; Indels 22; Gaps 1; 
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Qy 35 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 94 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

Qy 95 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 15 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 12 0 

Qy 15 5 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

Qy 215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

Qy 253 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

Qy 313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

Qy 3 73 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 432 

I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

Qy 433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 492 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 480 

Qy 493 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

Qy 553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

Qy 613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

Qy 6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 732 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 
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Qy 733 CCCCCAGGGCTGTGC7AAACACATGCCCCTGCCAT7AAGCACC7AAC7AAG7AACTTCTTGCAGG 792 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

Qy 793 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 

Qy 853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 



RESULT 4 
AES72471 

ID AES72471 standard; cDNA; 881 BP. 

XX 

AC AES72471; 
XX 

DT 03-MAY-2007 (first entry) 
XX 

DE Human C1958 splice variant 1, cDNA. 
XX 

KW Pancreatic ductal adenocarcinoma; cancer; cytostatic; tumor marker; 

KW protein therapy; screening; splice variant; ss; gene; C1958V1; apoptosis; 

KW gene therapy; pancreas tumor; lung tumor; renal tumor; testicle tumor. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 163. .393 

FT /*tag= a 

FT /product= "C1958V1 protein" 
XX 

PN WO2007013358-A2 . 
XX 

PD 01-FEB-2007. 

XX 

PF 14-JUL-2006; 2006WO-JP314442 . 
XX 

PR 28-JUL-2005; 2 0 05US-0 703 79 IP . 
XX 

PA (ONCO-) ONCOTHERAPY SCI INC. 

PA (UYTY ) UNIV TOKYO. 

XX 

PI Nakamura Y, Katagiri T, Inaki K; 
XX 

DR WPI; 2007-283242/27. 

DR P-PSDB; AES72472. 
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XX 

PT New VIVIT polypeptide useful for treating or preventing cancer, such as 

PT pancreatic cancer, lung cancer, kidney cancer and testicular tumor. 
XX 

PS Disclosure; SEQ ID NO 1; 78pp; English. 

XX 

CC The invention relates to a VIVIT polypeptide (AES72497) and at least a 

CC fragment of the human C1958 sequence appearing as AES72472, in which 

CC residues at positions 37-41 is replaced with AES72497, or an amino acid 

CC sequence of a polypeptide functionally equivalent to the polypeptide 

CC comprising the fragment sequence. Also included are an agent for 

CC treating/preventing cancer (comprising as an active ingredient the VIVIT 

CC polypeptide, or a polynucleotide encoding the polypeptide) , a 

CC pharmaceutical composition (comprising the VIVIT polypeptide, and a 

CC carrier), screening (Ml) for a compound useful in treating/preventing 

CC cancers (involving (a) contacting a polypeptide comprising a PPP3CA- 

CC binding domain of a C1958 polypeptide with a polypeptide comprising a 

CC C1958-binding domain of a PPP3CA polypeptide in the presence of a test 

CC compound, (b) detecting binding between the polypeptides, and (c) 

CC selecting a test compound that inhibits binding between the 

CC polypeptides), a kit for screening for a compound for treating or 

CC preventing cancers, treating/preventing cancers in a subject (involving 

CC administering the compound selected above) , and inducing apoptosis in a 

CC cell (involving introducing a polypeptide having a dominant-negative 

CC effect against C1958 or a polynucleotide encoding the polypeptide into 

CC the cell, where the polypeptide comprises a fragment sequence having 

CC AES72497, or the mutated C1958 above) . The polypeptide is modified with a 

CC cell-membrane permeable substance, which has the general formula [R]-[D], 

CC where [R] represents the cell-membrane permeable substance, and [D] 

CC represents the amino acid sequence of a fragment sequence of AES72497, or 

CC the mutated C1958 peptide. The VIVIT polypeptide is useful for treating 

CC and/or preventing cancer, preferably pancreatic cancer (especially 

CC Pancreatic ductal adenocarcinoma) , lung cancer, kidney cancer and 

CC testicular tumor. (Ml) is useful for screening a compound for treating or 

CC preventing cancers. The present sequence encodes a splice variant of 

CC human C1958. 

XX 

SQ Sequence 881 BP; 178 A; 276 C; 264 G; 163 T; 0 U; 0 Other; 

Query Match 92.6%; Score 827; DB 22; Length 881; 

Best Local Similarity 97.5%; Pred. No. 2.4e-189; 

Matches 859; Conservative 0; Mismatches 0; Indels 22; Gaps 1; 

Qy 35 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 9 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GGGCCATGACCCCCGCTGCTCTGTCTTGCAGGCTCGTCGCCGCGGCCCCCCGAGCCCGAC 6 0 

Qy 95 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 154 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 61 CGCCGCCGCCACCACCACCAGCGCCCGGGCGGGCCTCGCGCGCCTCGGGCGCGGCTCCGC 120 



Qy 15 5 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 214 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 AGTGAGCCCACCAAGAAGGAAGCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCC 180 

Qy 215 TGCCTGAA AGCAGCAGCAGCAGCCACGACGAGGCCCCC 252 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGCCTGAAAGGCTTTCAAATGTGTGTCAGCAGCAGCAGCAGCAGCCACGACGAGGCCCCC 2 40 

Qy 253 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 312 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTCCTGAACGACAAGCACCTGGACGTGCCCGACATCATCATCACGCCCCCCACCCCCACG 300 

Qy 313 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 3 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GGCATGATGCTGCCGAGGGACTTGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGC 36 0 

Qy 3 73 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 432 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGG 42 0 

Qy 433 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 492 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 CTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTG 48 0 

Qy 493 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 552 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 CTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGG 5 40 

Qy 553 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 612 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 ACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTG 6 00 

Qy 613 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 6 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GACCGGACCTCTGACACCTCCAGGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCA 66 0 

Qy 6 73 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 732 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAG 720 

Qy 733 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 792 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 CCCCCAGGGCTGTGCAAACACATGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGG 78 0 

Qy 793 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 852 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 TGGAGTGGCTGTTTTTTATAAGTTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTT 8 40 
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Qy 853 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 8 93 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 ATAATTTCTTTTTTGCATTATAAATAAAGATCCTCTGTAAC 881 



RESULT 5 
AAI90742/C 

ID AAI90742 standard; cDNA; 963 BP. 

XX 

AC AAI90742; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 10802. 
XX 

KW Human; cytokine; cell proliferation; cell differentiation; gene therapy; 

KW vaccine; peptide therapy; stem cell growth factor; haematopoiesis ; 

KW tissue growth factor; immunomodulatory; cancer; leukaemia; 

KW nervous system disorders; arthritis; inflammation; ss. 

XX 

OS Homo sapiens . 
XX 

PN WO200164835-A2 . 
XX 

PD 07-SEP-2001. 
XX 

PF 26-FEB-2001; 2 0 0 1WO-US0 0 49 2 7 . 
XX 

PR 28-FEB-2000; 2000US-00515126 . 

PR 18-MAY-2000; 2 0 0 OUS-0 05 7 74 0 9 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-514838/56. 

DR P-PSDB; AAO10811. 

XX 

PT Isolated nucleic acids and polypeptides, useful for preventing diagnosing 

PT and treating e.g. leukemia, inflammation and immune disorders. 

XX 

PS Claim 1; SEQ ID NO 10802; 1399pp + Sequence Listing; English. 
XX 

CC The invention relates to human polynucleotides (AAI79941-AAI93841 ) and 

CC the encoded proteins (AAO00010-AAO13910 ) that exhibit activity elating to 

CC cytokine, cell proliferation or cell differentiation or which may induce 

CC production of other cytokines in other cell populations. The 

CC polynucleotides and polypeptides are useful in gene therapy, vaccines or 
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CC peptide therapy. The polypeptides have various cytokine-like activities, 

CC e.g. stem cell growth factor activity, haematopoiesis regulating 

CC activity, tissue growth factor activity, immunomodulatory activity and 

CC activin/inhibin activity and may be useful in the diagnosis and/or 

CC treatment of cancer, leukaemia, nervous system disorders, arthritis and 

CC inflammation. Note: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 963 BP; 202 A; 296 C; 269 G; 194 T; 0 U; 2 Other; 

Query Match 75.1%; Score 6 71; DB 4; Length 963; 

Best Local Similarity 100.0%; Pred. No. l.le-151; 

Matches 6 71; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 2 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 79 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 620 

Qy 2 83 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 56 0 

Qy 343 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 402 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 55 9 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 500 

Qy 403 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 499 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 440 

Qy 46 3 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 522 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43 9 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 380 

Qy 523 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 5 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 79 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 32 0 

Qy 583 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 6 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 319 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 26 0 

Qy 6 43 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 25 9 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 200 

Qy 703 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 76 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



http://es/ScoreAccessWeh/GetIt( m acti. n ' Vppld=1052959 10_005435_us-10-529-592a-3.rng&ItemType=4&startByte=0 (15 of 44)6/7/2008 9:10:51 PM 



SCORE Search Results Details for Application 10529592 and Search Result 20080410_005435_us-10-529-592a-3.rng. 

Db 19 9 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 140 



QY 



763 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 822 




Db 



13 9 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 8 0 



Qy 



823 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 82 




Db 



7 9 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 2 0 



Qy 



883 TCCTCTGTAAC 893 



Db 



19 TCCTCTGTAAC 9 



RESULT 6 
ADE09694/c 

ID ADE09694 standard; DNA; 963 BP. 
XX 

AC ADE0 96 9 4; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Novel DNA-related contig nucleotide sequence #416. 
XX 

KW novel gene; novel protein; tissue marker; molecular weight marker; 

KW chromosome marker; genetic disorder; contig; ds . 

XX 

OS Unidentified. 

XX 

PN WO2003054152-A2 . 
XX 

PD 03-JUL-2003. 

XX 

PF 10-DEC-2002; 2002WO-US039555 . 
XX 

PR 10-DEC-2001; 2 0 0 1US-0 33 9 73 9P . 

PR ll-DEC-2001; 2001US-0339453P . 

PR 14-MAR-2002; 2002US-0365091P . 

PR 14-MAR-2002; 2002US-0365384P . 

PR 12-APR-2002; 2002US-0372381P . 

PR 12-APR-2002; 2002US-0372615P . 

PR 22-APR-2002; 2002US-00128558 . 

PR 24-APR-2002; 2 0 02US-03 76 0 45P . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Asundi V, Goodrich RW, Ren F, Zhang J, Zhao QA, Wang J; 

PI Ghosh M, Xue AJ, Wehrman T, Weng G, Zhou P, Drmanac RT, Wang Z; 



http://es/ScoreAccessWeb/GetItem.action?AppId=1052959.. .10_005435_us-10-529-592a-3.rng&ItemT\ pe=4&startB\ te=0 ( 16 of 44)6/7/2008 9:10:5 1 PM 



SCORE Search Results Details for Application 105295 12 ind Search Result 20080410_005435_us-10-529-592a-3.rng. 



PI Ma Y, Wang D, Chen R, Xu C, Boyle BJ; 

XX 

DR WPI; 2003-569235/53. 
XX 

PT New polynucleotides, useful for expressing recombinant proteins for 

PT analysis, characterization or therapeutic use, or as markers for tissues 

PT in which the corresponding protein is preferentially expressed. 

XX 

PS Disclosure; SEQ ID NO 2238; 1177pp; English. 

XX 

CC The invention comprises the amino acid and coding seguences of novel 

CC proteins. The DNA and protein seguences of the invention are useful as: 

CC markers for tissues in which the corresponding protein is preferentially 

CC expressed; as molecular weight markers on gels; as chromosome markers or 

CC tags; to identify chromosomes or to map related gene positions; and to 

CC compare with endogenous DNA seguences in patients to identify potential 

CC genetic disorders. The present DNA seguence was used in the 

CC exemplification of the invention. 
XX 

SQ Seguence 963 BP; 202 A; 296 C; 269 G; 194 T; 0 U; 2 Other; 



Query Match 75.1%; Score 6 71; DB 9; Length 963; 

Best Local Similarity 100.0%; Pred. No. l.le-151; 

Matches 671; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 2 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 79 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 620 



Qy 283 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 56 0 



Qy 343 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 55 9 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 500 



Qy 403 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 46 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 499 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 440 



Qy 463 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 522 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43 9 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 380 



Qy 523 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 5 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 79 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 32 0 
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Qy 583 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 6 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 319 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 26 0 

Qy 6 43 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 702 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 25 9 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 200 

Qy 703 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 76 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 19 9 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 140 

Qy 763 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 822 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 13 9 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 8 0 

Qy 823 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 9 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 2 0 

Qy 883 TCCTCTGTAAC 893 

I I I I I I I I I I I 
Db 19 TCCTCTGTAAC 9 



RESULT 7 
AAK88446 

ID AAK8 8 446 standard; cDNA; 761 BP. 

XX 

AC AAK8 8 446; 
XX 

DT 05-NOV-2001 (first entry) 
XX 

DE Human digestive system antigen coding sequence SEQ ID NO: 762. 
XX 

KW Human; digestive system antigen; gene therapy; cancer; appendicitis; 

KW ulcerative colitis; infection; Hirschsprung's disease; chronic colitis; 

KW digestive system disorder; Meckel's diverticulum; ss . 
XX 

OS Homo sapiens . 
XX 

PN WO200155314-A2 . 
XX 

PD 02-AUG-2001. 

XX 

PF 17-JAN-2001; 2001WO-US001324 . 
XX 

PR 31-JAN-2000; 2000US-0179065P . 

PR 04-FEB-2000; 2000US-0180628P . 
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2000US-0184664P. 
2000US-0186350P. 
2000US-0189874P. 
2000US-0190076P. 
2000US-0198123P . 
2000US-0205515P. 
2000US-0209467P. 
2000US-0214886P. 
2000US-0215135P. 
2000US-0216647P. 
2000US-0216880P. 
2000US-0217487P. 
2000US-0217496P. 
2000US-0218290P . 
2000US-0220963P. 
2000US-0220964P. 
2000US-0224518P. 
2000US-0224519P. 
2000US-0225213P. 
2000US-0225214P. 
2000US-0225266P. 
2000US-0225267P. 
2000US-0225268P. 
2000US-0225270P. 
2000US-0225447P. 
2000US-0225757P. 
2000US-0225758P. 
2000US-0225759P. 
2000US-0226279P. 
2000US-0226681P. 
2000US-0226868P. 
2000US-0227182P . 
2000US-0227009P. 
2000US-0228924P. 
2000US-0229287P. 
2000US-0229343P. 
2000US-0229344P. 
2000US-0229345P. 
2000US-0229509P. 
2000US-0229513P. 
2000US-0230437P. 
2000US-0230438P. 
2000US-0231242P. 
2000US-0231243P. 
2000US-0231244P. 
2000US-0231413P. 
2000US-0231414P. 
2000US-0232080P. 
2000US-0232081P. 
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2000US-0231968P. 
2000US-0232397P. 
2000US-0232398P. 
2000US-0232399P. 
2000US-0232400P. 
2000US-0232401P. 
2000US-0233063P. 
2000US-0233064P. 
2000US-0233065P. 
2000US-0234223P. 
2000US-0234274P. 
2000US-0234997P. 
2000US-0234998P. 
2000US-0235484P. 
2000US-0235834P. 
2000US-0235836P. 
2000US-0236327P. 
2000US-0236367P. 
2000US-0236368P. 
2000US-0236369P. 
2000US-0236370P. 
2000US-0236802P. 
2000US-0237037P. 
2000US-0237038P. 
2000US-0237039P. 
2000US-0237040P. 
2000US-0239935P. 
2000US-0239937P. 
2000US-0240960P. 
2000US-0241221P. 
2000US-0241785P. 
2000US-0241786P. 
2000US-0241787P. 
2000US-0241808P. 
2000US-0241809P. 
2000US-0241826P. 
2000US-0244617P. 
2000US-0246474P. 
2000US-0246475P. 
2000US-0246476P. 
2000US-0246477P. 
2000US-0246478P. 
2000US-0246523P. 
2000US-0246524P. 
2000US-0246525P. 
2000US-0246526P. 
2000US-0246527P. 
2000US-0246528P. 
2000US-0246532P. 
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XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 
XX 

DR WPI; 2001-502630/55. 

DR P-PSDB; AAM926 73. 
XX 

PT Polynucleotides encoding digestive system antigens, useful for 

PT diagnosing, treating, preventing and/or prognozing disorders of the 

PT digestive system, particularly cancer and cancer metastases . 

XX 

PS Claim 1; SEQ ID NO 762; 986pp; English. 
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CC The present invention provides the protein and coding sequences of a 

CC number of human digestive system antigens. These can be used in the 

CC diagnosis, treatment and prevention of digestive system disorders, 

CC including cancer, Meckel's diverticulum, bacterial or parasitic 

CC infections, appendicitis, Hirschsprung's disease, chronic colitis or 

CC ulcerative colitis. The present sequence is a cDNA encoding a digestive 

CC system antigen of the invention 

XX 

SQ Sequence 761 BP; 183 A; 208 C; 218 G; 150 T; 0 U; 2 Other; 

Query Match 74.2%; Score 662.2; DB 4; Length 761; 

Best Local Similarity 96.6%; Pred. No. 1.4e-149; 

Matches 714; Conservative 2; Mismatches 0; Indels 23; Gaps 3; 

Qy 176 GCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAA 222 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCGGCCTGCAGAGGTGCCGACATGGGGCTTAAGATGTCCTGCCTGAAAGCTTTCAAATGT 6 0 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGG 2 74 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I I I I I I 
Db 61 GTGTCAGCAGCAGCAGCAGCAGCCACGACGAGG-CCCCGTCCTGAAMGWCAAGCACCTGG 119 

Qy 2 75 ACGTGCCCGACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACT 33 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 0 ACGTGCCCGACATCATCATCACGCCCCCCA-CCCCACGGGCATGATGCTGCCGAGGGACT 178 

Qy 335 TGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACC 394 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 179 TGGGGAGCACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACC 23 8 

Qy 395 CAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCT 454 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 23 9 CAGAAGCCTGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCT 298 

Qy 45 5 TCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACT 514 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 29 9 TCAGGTGTCCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACT 35 8 

Qy 515 GAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGT 5 74 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 9 GAGCGAACCCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGT 418 

Qy 575 CCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCA 634 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 419 CCACTTCCAGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCA 478 

Qy 635 GGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCC 694 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 479 GGTTCTTGCTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCC 538 



QY 



6 95 AGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACA 75 4 




Db 



53 9 AGGGAGAGGCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACA 5 9 8 



Qy 



75 5 TGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAG 814 




Db 



5 9 9 TGCCCCTGCCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAG 6 5 8 



Qy 



815 TTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATA 8 74 




Db 



65 9 TTGTTTTACAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATA 718 



Qy 



8 75 AATAAAGATCCTCTGTAAC 8 93 




Db 



719 AATAAAGATCCTCTGTAAC 73 7 



RESULT 8 
ADR26670/C 

ID ADR26670 standard; DNA; 614 BP. 
XX 

AC ADR26670; 
XX 

DT 21-OCT-2004 (first entry) 
XX 

DE Breast cancer prognosis marker #2531. 
XX 

KW ds; breast cancer; prognosis; gene expression; diagnosis. 
XX 

OS Homo sapiens . 
XX 

PN WO2004065545-A2 . 
XX 

PD 05-AUG-2004. 

XX 

PF 15-JAN-2004; 2004WO-US001100 . 
XX 

PR 15-JAN-2003; 2003US-00342887 . 
XX 

PA (ROSE-) ROSETTA INPHARMATICS LLC. 

PA (NECA-) NETHERLANDS CANCER INST. 
XX 

PI Van't Veer LJ, He Y; 
XX 

DR WPI; 2004-593473/57. 
XX 

PT Classifying a breast cancer patient according to prognosis comprises 
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PT determining the similarity between the level of expression of each of 

PT five genes in a cell sample taken from patient, to control levels. 
XX 

PS Disclosure; SEQ ID NO 2531; 226pp; English. 

XX 

CC The invention relates to a method of classifying a breast cancer patient 

CC according to prognosis by determining the similarity between the level of 

CC expression of each of five genes for which markers are listed in the 

CC specification, in a cell sample taken from the breast cancer patient, to 

CC control levels of expression for each respective five genes to obtain a 

CC patient similarity value. The methods are useful for classifying a breast 

CC cancer patient according to prognosis. Kits and computer program products 

CC are useful for data analysis using the diagnostic, prognostic and 

CC statistical methods of the invention. This sequence corresponds to a 

CC marker used in the method of the invention. 

XX 

SQ Sequence 614 BP; 130 A; 179 C; 171 G; 134 T; 0 U; 0 Other; 



Query Match 6 8.4%; Score 611; DB 13; Length 614; 

Best Local Similarity 100.0%; Pred. No. 3.1e-137; 

Matches 611; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 283 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 614 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 555 



Qy 343 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 55 4 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 495 



Qy 403 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 46 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 494 TGAGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGT 435 



Qy 463 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 522 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43 4 CCGGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAAC 3 75 



Qy 523 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 5 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 74 CCGGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCC 315 



Qy 583 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 6 42 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 314 AGAACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTG 255 



Qy 6 43 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 254 CTGACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAG 195 
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Qy 703 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 76 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 194 GCTCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTG 135 

Qy 763 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 822 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 13 4 CCATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTA 75 

Qy 823 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 8 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 4 CAGATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGA 15 

Qy 883 TCCTCTGTAAC 893 

I I I I I I I I I I I 
Db 14 TCCTCTGTAAC 4 



RESULT 9 
AAK90422 

ID AAK90422 standard; DNA; 2798 BP. 
XX 

AC AAK9 0 42 2; 
XX 

DT 05-NOV-2001 (first entry) 
XX 

DE Human digestive system antigen genomic sequence SEQ ID NO: 3998. 
XX 

KW Human; digestive system antigen; gene therapy; cancer; appendicitis; 

KW ulcerative colitis; infection; Hirschsprung's disease; chronic colitis; 

KW digestive system disorder; Meckel's diverticulum; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200155314-A2 . 
XX 

PD 02-AUG-2001. 

XX 

PF 17-JAN-2001; 2001WO-US001324 . 
XX 

PR 31-JAN-2000; 2000US-0179065P . 

PR 04-FEB-2000; 2000US-0180628P . 

PR 24-FEB-2000; 2 0 0 OUS-0 1 8 46 6 4P . 

PR 02-MAR-2000; 2000US-0186350P . 

PR 16-MAR-2000; 2000US-0189874P . 

PR 17-MAR-2000; 2000US-0190076P . 

PR 18-APR-2000; 2000US-0198123P . 

PR 19-MAY-2000; 2000US-0205515P . 

PR 07-JUN-2000; 2000US-020946 7P . 
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2000US-0214886P. 
2000US-0215135P. 
2000US-0216647P. 
2000US-0216880P. 
2000US-0217487P . 
2000US-0217496P. 
2000US-0218290P. 
2000US-0220963P. 
2000US-0220964P. 
2000US-0224518P. 
2000US-0224519P. 
2000US-0225213P. 
2000US-0225214P. 
2000US-0225266P. 
2000US-0225267P. 
2000US-0225268P. 
2000US-0225270P. 
2000US-0225447P. 
2000US-0225757P. 
2000US-0225758P. 
2000US-0225759P. 
2000US-0226279P. 
2000US-0226681P. 
2000US-0226868P. 
2000US-0227182P. 
2000US-0227009P. 
2000US-0228924P. 
2000US-0229287P. 
2000US-0229343P. 
2000US-0229344P. 
2000US-0229345P. 
2000US-0229509P. 
2000US-0229513P. 
2000US-0230437P. 
2000US-0230438P. 
2000US-0231242P. 
2000US-0231243P. 
2000US-0231244P. 
2000US-0231413P. 
2000US-0231414P. 
2000US-0232080P. 
2000US-0232081P. 
2000US-0231968P. 
2000US-0232397P. 
2000US-0232398P. 
2000US-0232399P. 
2000US-0232400P. 
2000US-0232401P. 
2000US-0233063P. 
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2000US-0233064P. 
2000US-0233065P. 
2000US-0234223P. 
2000US-0234274P. 
2000US-0234997P. 
2000US-0234998P. 
2000US-0235484P. 
2000US-0235834P. 
2000US-0235836P. 
2000US-0236327P. 
2000US-0236367P. 
2000US-0236368P. 
2000US-0236369P. 
2000US-0236370P. 
2000US-0236802P. 
2000US-0237037P. 
2000US-0237038P. 
2000US-0237039P. 
2000US-0237040P. 
2000US-0239935P. 
2000US-0239937P. 
2000US-0240960P. 
2000US-0241221P. 
2000US-0241785P. 
2000US-0241786P. 
2000US-0241787P. 
2000US-0241808P. 
2000US-0241809P. 
2000US-0241826P. 
2000US-0244617P. 
2000US-0246474P. 
2000US-0246475P. 
2000US-0246476P. 
2000US-0246477P. 
2000US-0246478P. 
2000US-0246523P. 
2000US-0246524P. 
2000US-0246525P. 
2000US-0246526P. 
2000US-0246527P. 
2000US-0246528P. 
2000US-0246532P. 
2000US-0246609P. 
2000US-0246610P. 
2000US-0246611P. 
2000US-0246613P. 
2000US-0249207P. 
2000US-0249208P. 
2000US-0249209P. 
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PR 17-NOV-2000; 2000US-0249210P . 

PR 17-NOV-2000; 2000US-0249211P . 

PR 17-NOV-2000; 2000US-0249212P . 

PR 17-NOV-2000; 2000US-0249213P . 

PR 17-NOV-2000; 2000US-0249214P . 

PR 17-NOV-2000; 2000US-0249215P . 

PR 17-NOV-2000; 2000US-0249216P . 

PR 17-NOV-2000; 2 0 0 OUS-0 2 4 9 2 1 7P . 

PR 17-NOV-2000; 2000US-0249218P . 

PR 17-NOV-2000; 2000US-0249244P . 

PR 17-NOV-2000; 2000US-0249245P . 

PR 17-NOV-2000; 2 0 0 OUS-0 2 4 9 26 4P . 

PR 17-NOV-2000; 2000US-0249265P . 

PR 17-NOV-2000; 2 0 0 OUS-0 2 4 9 2 9 7P . 

PR 17-NOV-2000; 2000US-0249299P . 

PR 17-NOV-2000; 2000US-0249300P . 

PR 01-DEC-2000; 2000US-0250160P . 

PR 01-DEC-2000; 2000US-0250391P . 

PR 05-DEC-2000; 2000US-0251030P . 

PR 05-DEC-2000; 2000US-0251988P . 

PR 05-DEC-2000; 2000US-0256 719P . 

PR 06-DEC-2000; 2000US-0251479P . 

PR 08-DEC-2000; 2000US-0251856P . 

PR 08-DEC-2000; 2000US-0251868P . 

PR 08-DEC-2000; 2000US-0251869P . 

PR 08-DEC-2000; 2000US-0251989P . 

PR 08-DEC-2000; 2000US-0251990P . 

PR ll-DEC-2000; 2000US-0254097P . 

PR 05-JAN-2001; 2001US-0259678P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 
XX 

DR WPI; 2001-502630/55. 

XX 

PT Polynucleotides encoding digestive system antigens, useful for 

PT diagnosing, treating, preventing and/or prognozing disorders of the 

PT digestive system, particularly cancer and cancer metastases . 

XX 

PS Disclosure; SEQ ID NO 3998; 986pp; English. 

XX 

CC The present invention provides the protein and coding sequences of a 

CC number of human digestive system antigens. These can be used in the 

CC diagnosis, treatment and prevention of digestive system disorders, 

CC including cancer, Meckel's diverticulum, bacterial or parasitic 

CC infections, appendicitis, Hirschsprung's disease, chronic colitis or 

CC ulcerative colitis. The present sequence is a genomic DNA fragment 

CC encoding a digestive system antigen of the invention 
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XX 

SQ Sequence 2798 BP; 594 A; 775 C; 809 G; 620 T; 0 U; 0 Other; 

Query Match 61.5%; Score 549; DB 4; Length 2798; 

Best Local Similarity 100.0%; Pred. No. 4.7e-122; 

Matches 549; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 345 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 40 4 

I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2240 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 22 99 



Qy 405 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 46 4 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2300 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 2359 



Qy 46 5 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 52 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 236 0 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 2 419 



Qy 52 5 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 584 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 42 0 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 2 4 79 



Qy 585 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 6 44 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2480 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 2539 



Qy 6 45 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 70 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2540 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 25 99 



Qy 705 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 76 4 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 0 0 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 26 59 



Qy 76 5 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 82 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 0 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 2 719 



Qy 82 5 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 884 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 72 0 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 2 7 79 



Qy 885 CTCTGTAAC 893 

I I I I I I I I I 
Db 2780 CTCTGTAAC 2788 



RESULT 10 
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AAK90423 

ID AAK90423 standard; 
XX 

AC AAK9 0 423; 



DNA; 2804 BP. 



DT 05-NOV-2001 (first entry) 
XX 

DE Human digestive system antigen genomic sequence SEQ ID NO: 3999. 
XX 

KW Human; digestive system antigen; gene therapy; cancer; appendicitis; 

KW ulcerative colitis; infection; Hirschsprung's disease; chronic colitis; 

KW digestive system disorder; Meckel's diverticulum; ds . 



OS 
XX 



XX 
PD 



Homo sapiens . 



WO200155314-A2 . 



02-AUG-2001 . 



PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



PR 
PR 



17-JAN-2001; 2001WO-US001324 . 



31-JAN-2000; 
04-FEB-2000; 
24-FEB-2000; 
02-MAR-2000; 

16- MAR-2 0 00; 

17- MAR-2 0 00; 



-APR-2 0 00; 
-MAY-2 0 00; 



0 7-JUN-2 0 00; 
28-JUN-2000; 
30-JUN-2000; 
07-JUL-2000; 
07-JUL-2000; 
ll-JUL-2000; 
ll-JUL-2000; 
14-JUL-2000; 
26-JUL-2 0 00; 
26-JUL-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2000; 
14-AUG-2000; 
14-AUG-2000; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 
14-AUG-2 0 00; 



2000US-0179065P. 
2000US-0180628P. 
2000US-0184664P. 
2000US-0186350P. 
2000US-0189874P. 
2000US-0190076P. 
2000US-0198123P. 
2000US-0205515P. 
2000US-0209467P. 
2000US-0214886P. 
2000US-0215135P. 
2000US-0216647P. 
2000US-0216880P. 
2000US-0217487P. 
2000US-0217496P. 
2000US-0218290P. 
2000US-0220963P. 
2000US-0220964P. 
2000US-0224518P. 
2000US-0224519P . 
2000US-0225213P . 
2000US-0225214P. 
2000US-0225266P. 
2000US-0225267P. 
2000US-0225268P. 
2000US-0225270P. 
2000US-0225447P. 
2000US-0225757P. 
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05 
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-2001; 
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-0259678P 



XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 
XX 

DR WPI; 2001-502630/55. 

XX 

PT Polynucleotides encoding digestive system antigens, useful for 

PT diagnosing, treating, preventing and/or prognozing disorders of the 

PT digestive system, particularly cancer and cancer metastases . 

XX 

PS Disclosure; SEQ ID NO 3999; 986pp; English. 
XX 

CC The present invention provides the protein and coding sequences of a 

CC number of human digestive system antigens. These can be used in the 

CC diagnosis, treatment and prevention of digestive system disorders, 

CC including cancer, Meckel's diverticulum, bacterial or parasitic 

CC infections, appendicitis, Hirschsprung's disease, chronic colitis or 

CC ulcerative colitis. The present sequence is a genomic DNA fragment 

CC encoding a digestive system antigen of the invention 
XX 

SQ Sequence 2804 BP; 598 A; 777 C; 808 G; 621 T; 0 U; 0 Other; 



Query Match 61.3%; Score 547.4; DB 4; Length 2804; 

Best Local Similarity 99.8%; Pred. No. 1.2e-121; 

Matches 548; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 345 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 40 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 2 46 AGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTG 23 05 



Qy 405 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 46 4 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2306 AGGAGGTGTCCTGGGTTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCC 2365 



Qy 46 5 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 52 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 236 6 GGGGGCGTGGCTGCCTGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCC 2 425 
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Qy 52 5 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 584 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2426 GGGCCTCCGCTCAGAGAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAG 2485 

Qy 585 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 6 44 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2486 AACAGTGTTTCCCAGGCCCCGCTGAGTGGACCGGACCTCTGACACCTCCAGGTTCTTGCT 2545 

Qy 6 45 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 70 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2546 GACTCCGGCCTGGTGAAAGGGAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAGAGGC 26 05 

Qy 705 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTGCC 76 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 26 06 TCTCTTCTGGACAAACACACCCTCCCAGCCCCCAGGGCTGTGCAAACACATGCCCCTCCC 26 65 

Qy 76 5 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 82 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2666 ATAAGCACCAACAAGAACTTCTTGCAGGTGGAGTGGCTGTTTTTTATAAGTTGTTTTACA 2725 

Qy 82 5 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 884 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 726 GATACGGAAACAGTCCAAAATGGGATTTATAATTTCTTTTTTGCATTATAAATAAAGATC 2 785 

Qy 885 CTCTGTAAC 893 

I I I I I I I I I 
Db 2786 CTCTGTAAC 2794 



RESULT 11 
ABL65256 

ID ABL65256 standard; DNA; 574 BP. 
XX 

AC ABL65256; 
XX 

DT ll-JUN-2007 (revised) 

DT 15-MAY-2002 (first entry) 

XX 

DE Lung cancer related gene sequence SEQ ID NO: 3593. 
XX 

KW Human; cancer; colon; breast; ovary; oesophagus; kidney; thyroid; 

KW stomach; lung; prostate; pancreas; carcinoma; antitumour; cancerous; 

KW cytostatic; gene therapy; antineoplastic; Wilm's tumour; adenocarcinoma; 

KW gene; ds . 

XX 

OS Homo sapiens . 
XX 

PN WO200194629-A2 . 
XX 
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PA 
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XX 









2001WO-US010838 . 

2000US-0209473P. 
2000US-0209531P. 
2000US-0233133P. 
2000US-0233617P. 
2000US-0234009P. 
2000US-0234034P. 
2000US-0234052P. 
2000US-0234509P. 
2000US-0234567P. 
2000US-0234923P. 
2000US-0234924P. 
2000US-0235077P. 
2000US-0235082P. 
2000US-0235134P. 
2000US-0235280P. 
2000US-0235637P. 
2000US-0235638P. 
2000US-0235711P. 
2000US-0235720P. 
2000US-0235840P. 
2000US-0235863P. 
2000US-0236028P. 
2000US-0236032P. 
2000US-0236033P. 
2000US-0236034P. 
2000US-0236109P. 
2000US-0236111P. 
2000US-0236842P. 
2000US-0236891P. 
2000US-0237172P. 
2000US-0237173P. 
2000US-0237278P. 
2000US-0237294P. 
2000US-0237295P. 
2000US-0237316P. 
2000US-0237425P. 
2000US-0237598P. 
2000US-0237604P. 
2000US-0237606P. 
2000US-0237608P. 
2000US-0244867P. 
2000US-0245084P. 

>N PHARM. 
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PI Young PE, Augustus M, Carter KC, Ebner R, Endress G, Horrigan S; 

PI Soppet DR, Weaver Z; 

XX 

DR WPI; 2002-188264/24. 

DR PC:NCBI; gil318992. 

XX 

PT Screening for ant i-neoplast ic agent involves exposing cells to a chemical 

PT agent to be tested for ant i-neoplastic activity, and determining a change 

PT in expression of a gene of a signature gene set. 
XX 

PS Claim 1; SEQ ID NO 3593; 44pp; English. 

XX 

CC The present invention describes a method (Ml) for screening for an anti- 

CC neoplastic agent. The method involves exposing cells to a chemical agent 

CC to be tested for ant i-neoplast ic activity, determining a change in 

CC expression of at least one gene (I) of a signature gene set, where (I) 

CC comprises a sequence (S) selected from 8447 sequences (given in ABL61664 

CC to ABL70110), or is at least 95% identical to (S), where a change in 

CC expression is indicative of ant i-neoplastic activity. (I) has cytostatic 

CC activity and can be used in gene therapy. Ml can be used for screening an 

CC anti-neoplastic agent, and can be used for producing a product which is 

CC the data collected with respect to the anti-neoplastic agent as a result 

CC of Ml, and the data is sufficient to convey the chemical structure and/or 

CC properties of the agent. Ml can be used in the treatment of cancer such 

CC as colon, breast, stomach, lung, thyroid, oesophageal, ovarian, kidney, 

CC prostate or pancreatic cancer, adenocarcinoma, carcinoma, clear cell 

CC cancer, infiltrating ductal cancer, infiltrating lobular cancer, squamous 

CC cell carcinoma, neuroendocrine carcinoma, papillary carcinoma and Wilm's 

CC tumour 

CC 

CC Revised record issued on ll-JUN-2007 : Enhanced with precomputed 

CC information from BOND. 

XX 

SQ Sequence 574 BP; 140 A; 144 C; 158 G; 128 T; 0 U; 4 Other; 



Query Match 41.1%; Score 36 7; DB 6; Length 5 74; 

Best Local Similarity 88.8%; Pred. No. 2.5e-78; 

Matches 491; Conservative 0; Mismatches 42; Indels 20; Gaps 8; 

Qy 36 0 GACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGG 419 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 GACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCCTGAGGAGGTGTCCTGGG 6 6 

Qy 420 TTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCC 479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 7 TTTGGCTGGCTGGCTCCTGCTCCAGCGGCCCGGCTTCAGGTGTCCGGGGGCGTGGCTGCC 126 

Qy 48 0 TGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGA 53 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 12 7 TGGAGCAGGTGTGCTGAATACCCTGGATGGGAACTGAGCGAACCCGGGCCTCCGCTCAGA 186 



Qy 540 GAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAG 599 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 7 GAGACGTGGCAGGACCAGCGAGGAATCCAGCCTGTCCACTTCCAGAACAGTGTTTCCCAN 2 46 

Qy 600 GCCCCGCTGAGTGGACCGGACCTCTGACACCTCC-AGGTTCTTGCTGACTCCGGCCTGGT 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 24 7 GCCCCGCTNAGTGGACCGGACCTCTGACACCTCCAAGGTTCTTGCTGACTCCGGCCTGGT 3 06 

Qy 65 9 GAAAGGG-AGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGA— GAGGCTCTCTTCT-GG 714 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 30 7 GAAAGGGAAGCGCCATGGTCCTGGCTGTTGGGGTCCCAGGGAAGAAGGCTCTCTTCTNGG 36 6 

Qy 715 ACAAACACACCCTCCCAGCCCCCAGGGCTGT GCAAACACATGCCCCTGCCATAAGCA 7 71 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III III I I I I I I I I 

Db 36 7 ACAAACACACCCTCCCAGCCCCCAGGGCTGTTGCAAACACATTGCCCCTTGCCATAAAGC 426 

Qy 772 CCAACAAGAACTTCTTGCAGGTGGAGTG GCTGTTTTTTATAAGTTGTTTTA 822 

III II III I I I I I I I I I I III I I I I I I I I 

Db 42 7 ACCAAACAAAGAACTTCTTTGCAGGGTGGAGTGGGCTGTTTTTTAATAAAGTTTGTTTTA 486 

Qy 823 CAGA- T AC GGAAAC AGT CCAAAATGGGATT T AT AAT TTCTTTTTT — GCATTATAAATAA 8 79 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 48 7 CAGATTACGGAAACAGTTCAAAATGGGATTTATAATTTCTTTTTTTGCATTAATAAATAA 5 46 

Qy 880 AGATCCTCTGTAA 892 

I I I I I I I I I I I I 
Db 547 AGATCCTCTGTTA 559 



RESULT 12 
AFS82561 

ID AFS82561 standard; DNA; 203 BP. 
XX 

AC AFS82561; 
XX 

DT 20-SEP-2007 (first entry) 
XX 

DE Human transcript sequence, SEQ ID 1960. 
XX 

KW DNA detection; RNA detection; exon; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200268579-A2 . 
XX 

PD 06-SEP-2002. 
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PF 10-JAN-2002; 2 0 02WO-US0 0 02 8 4 . 
XX 

PR 10-JAN-2001; 2 0 0 1US-0 0 75 6 6 9 6 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter CJ, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2002-682812/73. 
XX 

PT New isolated nucleic acid detection reagent for detecting the presence of 

PT specified human exons . 

XX 

PS Claim 4; SEQ ID NO 1960; 40pp; English. 

XX 

CC The present invention relates to a novel isolated nucleic acid detection 

CC reagent for detecting the presence of specified human exons. The exon 

CC seguences cover every identified human transcript and exon comprising 

CC every gene/coding region of the human genome. The present seguence is one 

CC such exon seguence. The nucleic acid detection agent is used for 

CC detecting the presence of at least 100000, at least 2000, at least 50000 

CC or at least 10000 human exons. The seguences that span exon-exon 

CC junctions eliminate false signals caused by genomic contamination. This 

CC is because a detection element comprising two neighboring exons as one 

CC contiguous seguence will not hybridize to genomic DNA comprising 

CC intervening intronic DNA. These detection elements will only hybridize to 

CC expressed mRNA transcripts in which the exons are connected and the 

CC intronic seguence has been removed, therefore forming one contiguous 

CC stretch of seguence corresponding to the seguence of the detection 

CC element that spans the exon-exon junction. 

XX 

SQ Seguence 203 BP; 49 A; 65 C; 60 G; 29 T; 0 U; 0 Other; 



Query Match 20.5%; Score 183; DB 7; Length 203; 

Best Local Similarity 100.0%; Pred. No. 5e-34; 

Matches 183; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 2 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 21 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 8 0 



Qy 2 83 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 81 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 140 



Qy 343 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 141 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 200 
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Qy 403 TGA 405 

I I I 

Db 201 TGA 203 



RESULT 13 
ADE07493 

ID ADE0 7493 standard; DNA; 1617 BP. 
XX 

AC ADE07493; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Novel coding sequence (useful for identifying genetic disorders) #559. 
XX 

KW novel gene; novel protein; tissue marker; molecular weight marker; 

KW chromosome marker; genetic disorder; gene; ds . 

XX 

OS Unidentified. 

XX 

PN WO2003054152-A2 . 
XX 

PD 03-JUL-2003. 
XX 

PF 10-DEC-2002; 2002WO-US039555 . 
XX 

PR 10-DEC-2001; 2001US-0339739P . 

PR ll-DEC-2001; 2001US-0339453P . 

PR 14-MAR-2002; 2002US-0365091P . 

PR 14-MAR-2002; 2002US-0365384P . 

PR 12-APR-2002; 2002US-0372381P . 

PR 12-APR-2002; 2002US-0372615P . 

PR 22-APR-2002; 2002US-00128558 . 

PR 24-APR-2002; 2 0 02US-0 3 76 0 45P . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Asundi V, Goodrich RW, Ren F, Zhang J, Zhao QA, Wang J; 

PI Ghosh M, Xue AJ, Wehrman T, Weng G, Zhou P, Drmanac RT, Wang Z; 

PI Ma Y, Wang D, Chen R, Xu C, Boyle BJ; 
XX 

DR WPI; 2003-569235/53. 

DR P-PSDB; ADE08404. 

XX 

PT New polynucleotides, useful for expressing recombinant proteins for 

PT analysis, characterization or therapeutic use, or as markers for tissues 

PT in which the corresponding protein is preferentially expressed. 

XX 

PS Claim 1; SEQ ID NO 559; 1177pp; English. 
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XX 

CC The invention comprises the amino acid and coding sequences of novel 

CC proteins. The DNA and protein sequences of the invention are useful as: 

CC markers for tissues in which the corresponding protein is preferentially 

CC expressed; as molecular weight markers on gels; as chromosome markers or 

CC tags; to identify chromosomes or to map related gene positions; and to 

CC compare with endogenous DNA sequences in patients to identify potential 

CC genetic disorders. The present DNA sequence represents a gene of the 

CC invention. 
XX 

SQ Sequence 1617 BP; 374 A; 484 C; 474 G; 285 T; 0 U; 0 Other; 



Query Match 20.5%; Score 183; DB 9; Length 1617; 

Best Local Similarity 100.0%; Pred. No. 9.8e-34; 

Matches 183; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 2 82 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1435 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 1494 



Qy 283 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1495 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 1554 



Qy 343 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1555 ACAGTCTGGCTGGATGAGACAGGGTCGTGCCCAGATGATGGAGAAATCGACCCAGAAGCC 1614 



Qy 

Db 



403 TGA 405 
I I I 

1615 TGA 1617 



RESULT 14 
ACH91110 

ID ACH91110 standard; DNA; 170 BP. 
XX 

AC ACH91110; 
XX 

DT 29-JUL-2004 (first entry) 
XX 

DE Human genome derived single exon probe #24305. 
XX 

KW Human; probe; ss; gene expression; single exon probe; microarray; 

KW alternative splicing event; genomic alteration. 

XX 

OS Homo sapiens . 
XX 

PN US2003194704-A1 . 
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XX 

PD 16-OCT-2003. 
XX 

PF 03-APR-2002; 2002US-00029386 . 
XX 

PR 03-APR-2002; 2002US-00029386 . 
XX 

PA (PENN/) PENN S G . 

PA (RANK/) RANK D R. 

PA (HANZ/) HANZEL D K. 
XX 

PI Penn SG, Rank DR, Hanzel DK; 
XX 

DR WPI; 2004-119264/12. 
XX 

PT New human genome-derived single exon nucleic acid probes useful for human 

PT gene expression analysis, for identifying or characterizing alternative 

PT splicing events, for assessing genomic alterations or as tools for 

PT surveying tissues. 
XX 

PS Claim 1; SEQ ID NO 24305; 80pp; English. 

XX 

CC The invention relates to a nucleic acid probe for measuring human gene 

CC expression, comprising any of the 27, 400 fully defined nucleotide 

CC sequences in the specification, or their complements or fragments, and 

CC encoding at least 8 amino acids of any of the 6888 amino acid sequences 

CC fully defined in the specification. The probe is a single exon probe that 

CC hybridises under high stringency conditions to a nucleic acid molecule 

CC expressed in human cells or tissues. Also included are a spatially- 

CC addressable set of single exon nucleic acid probes for measuring human 

CC gene expression (comprising a plurality of single exon nucleic acid 

CC probes cited above, where each of the plurality of probes is separately 

CC and addressably isolatable or amplif iable from the plurality) , a single 

CC exon microarray for measuring human gene expression, a method of 

CC measuring human gene expression, a vector comprising the single exon 

CC probe cited above, an ORF-encoded peptide comprising at least 8 

CC contiguous amino acids of any of the above- mentioned amino acid 

CC sequences (optionally with conservative amino acid substitutions), an 

CC isolated antibody that binds specifically to a peptide cited above, 

CC methods of selling and/or licensing single exon probes or microarrays to 

CC a customer desiring to measure gene expression, a method of providing 

CC human gene expression data by subscription, and a computer-readable 

CC storage medium which contains a database having a plurality of records 

CC (each record including data on the expression of a single exon probe 

CC cited above. The probe, methods and apparatus are useful in gene 

CC expression analysis. The probes may be used as tools for surveying 

CC tissues to detect the presence of expressed messages that contain their 

CC specific exon, or in constructing genome-derived single exon microarrays. 

CC In addition, the probes are used in identifying and characterising 
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CC alternative splicing events, in detecting and characterising gross 

CC alterations in the genomic locus that includes their exon, in assessing 

CC smaller genomic alterations, in priming the synthesis of nucleic acids, 

CC or in expressing the ORF-encoded peptide. The present sequence is a human 

CC single exon probe of the invention. Note: The sequence data for this 

CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from USPTO at 

CC seqdata .uspto . gov/ sequence . html ?DocID=2 0 03 019 4 70 4 

XX 

SQ Sequence 170 BP; 37 A; 64 C; 47 G; 22 T; 0 U; 0 Other; 



Query Match 13.9%; Score 12 4; DB 12; Length 170; 

Best Local Similarity 100.0%; Pred. No. 8.3e-20; 

Matches 124; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 2 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 21 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 8 0 



Qy 2 83 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 81 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 140 



Qy 343 ACAG 346 

I I I I 

Db 141 ACAG 144 



RESULT 15 
ACH77410 

ID ACH77410 standard; DNA; 550 BP. 
XX 

AC ACH77410; 
XX 

DT 29-JUL-2004 (first entry) 
XX 

DE Human genome derived single exon probe #10605. 
XX 

KW Human; probe; ss; gene expression; single exon probe; microarray; 

KW alternative splicing event; genomic alteration. 

XX 

OS Homo sapiens . 
XX 

PN US2003194704-A1 . 
XX 

PD 16-OCT-2003. 

XX 

PF 03-APR-2002; 2002US-00029386 . 
XX 
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PR 03-APR-2002; 2002US-00029386 . 
XX 

PA (PENN/) PENN S G. 

PA (RANK/) RANK D R. 

PA (HANZ/) HANZEL D K. 
XX 

PI Penn SG, Rank DR, Hanzel DK; 
XX 

DR WPI; 2004-119264/12. 
XX 

PT New human genome-derived single exon nucleic acid probes useful for human 

PT gene expression analysis, for identifying or characterizing alternative 

PT splicing events, for assessing genomic alterations or as tools for 

PT surveying tissues . 
XX 

PS Claim 15; SEQ ID NO 10605; 80pp; English. 

XX 

CC The invention relates to a nucleic acid probe for measuring human gene 

CC expression, comprising any of the 27, 400 fully defined nucleotide 

CC sequences in the specification, or their complements or fragments, and 

CC encoding at least 8 amino acids of any of the 6888 amino acid sequences 

CC fully defined in the specification. The probe is a single exon probe that 

CC hybridises under high stringency conditions to a nucleic acid molecule 

CC expressed in human cells or tissues. Also included are a spatially- 

CC addressable set of single exon nucleic acid probes for measuring human 

CC gene expression (comprising a plurality of single exon nucleic acid 

CC probes cited above, where each of the plurality of probes is separately 

CC and addressably isolatable or amplif iable from the plurality) , a single 

CC exon microarray for measuring human gene expression, a method of 

CC measuring human gene expression, a vector comprising the single exon 

CC probe cited above, an ORF-encoded peptide comprising at least 8 

CC contiguous amino acids of any of the above- mentioned amino acid 

CC sequences (optionally with conservative amino acid substitutions), an 

CC isolated antibody that binds specifically to a peptide cited above, 

CC methods of selling and/or licensing single exon probes or microarrays to 

CC a customer desiring to measure gene expression, a method of providing 

CC human gene expression data by subscription, and a computer-readable 

CC storage medium which contains a database having a plurality of records 

CC (each record including data on the expression of a single exon probe 

CC cited above. The probe, methods and apparatus are useful in gene 

CC expression analysis. The probes may be used as tools for surveying 

CC tissues to detect the presence of expressed messages that contain their 

CC specific exon, or in constructing genome-derived single exon microarrays. 

CC In addition, the probes are used in identifying and characterising 

CC alternative splicing events, in detecting and characterising gross 

CC alterations in the genomic locus that includes their exon, in assessing 

CC smaller genomic alterations, in priming the synthesis of nucleic acids, 

CC or in expressing the ORF-encoded peptide. The present sequence is a human 

CC single exon probe of the invention. Note: The sequence data for this 
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CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from USPTO at 

CC seqdata.uspto.gov/sequence . html?DocID=2 0 03 019 4 70 4 

XX 

SQ Sequence 550 BP; 92 A; 186 C; 174 G; 98 T; 0 U; 0 Other; 

Query Match 13.9%; Score 124; DB 12; Length 550; 

Best Local Similarity 100.0%; Pred. No. 1.2e-19; 

Matches 124; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 223 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 2 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 2 AGCAGCAGCAGCAGCCACGACGAGGCCCCCGTCCTGAACGACAAGCACCTGGACGTGCCC 181 

Qy 2 83 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 3 42 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 GACATCATCATCACGCCCCCCACCCCCACGGGCATGATGCTGCCGAGGGACTTGGGGAGC 2 41 

Qy 343 ACAG 346 

I I I I 

Db 242 ACAG 245 
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